CQFD - 2012 - Annual activity report

CQFD

CQFD - 2012

Project-Team Cqfd

Members

Overall Objectives

Scientific Foundations

Application Domains

Dependability and safety

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: New Results

Singularly Perturbed Discounted Markov Control Processes in a General State Space

Participant : François Dufour.

Markov decision processes, optimal control, infinite discounted expected cost, optimal control, singular perturbation

In this work, it is studied the asymptotic optimality of discrete-time Markov Decision Processes (MDP's in short) with general state space and action space and having weak and strong interactions. The idea in this work is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter $ϵ > 0$ in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. Our objective is twofold. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as $ϵ$ goes to zero. In the second part of this work, it is proved that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal. Our work extends existing results of the literature in the following two directions: the underlying MDP is defined on general state and action spaces and we do not impose strong conditions on the recurrence structure of the MDP such as Doeblin's condition.

These results have been obtained in collaboration with Oswaldo Luis Do Valle Costa from Escola Politécnica da Universidade de São Paulo, Brazil.

It has been published in SIAM Journal of Control and Optimization [16] .

Previous |

Home | Next next